Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 2.293.481 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 751.8 MiB |
| Average record size in memory | 343.7 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 4 |
| Text | 1 |
brand is highly overall correlated with cat1 and 1 other fields | High correlation |
cat1 is highly overall correlated with brand and 1 other fields | High correlation |
cat2 is highly overall correlated with brand and 1 other fields | High correlation |
cust_request_tn is highly overall correlated with customer_id and 2 other fields | High correlation |
customer_id is highly overall correlated with cust_request_tn and 1 other fields | High correlation |
product_id is highly overall correlated with cust_request_tn and 2 other fields | High correlation |
sku_size is highly overall correlated with product_id | High correlation |
tn is highly overall correlated with cust_request_tn and 2 other fields | High correlation |
plan_precios_cuidados is highly imbalanced (90.5%) | Imbalance |
cust_request_tn is highly skewed (γ1 = 37.72862987) | Skewed |
tn is highly skewed (γ1 = 37.9431848) | Skewed |
stock_final has 1353142 (59.0%) zeros | Zeros |
Reproduction
| Analysis started | 2025-06-01 21:14:09.681442 |
|---|---|
| Analysis finished | 2025-06-01 21:15:34.386708 |
| Duration | 1 minute and 24.71 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
periodo
Date
| Distinct | 36 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 17.5 MiB |
| Minimum | 2017-01-01 00:00:00 |
|---|---|
| Maximum | 2019-12-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
customer_id
Real number (ℝ)
High correlation 
| Distinct | 597 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10172.667 |
| Minimum | 10001 |
|---|---|
| Maximum | 10637 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | 10001 |
|---|---|
| 5-th percentile | 10007 |
| Q1 | 10055 |
| median | 10135 |
| Q3 | 10269 |
| 95-th percentile | 10448 |
| Maximum | 10637 |
| Range | 636 |
| Interquartile range (IQR) | 214 |
Descriptive statistics
| Standard deviation | 142.26452 |
|---|---|
| Coefficient of variation (CV) | 0.013984978 |
| Kurtosis | -0.2440503 |
| Mean | 10172.667 |
| Median Absolute Deviation (MAD) | 99 |
| Skewness | 0.80407669 |
| Sum | 2.3330817 × 1010 |
| Variance | 20239.193 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10001 | 18327 | 0.8% |
| 10004 | 18254 | 0.8% |
| 10002 | 17544 | 0.8% |
| 10007 | 17522 | 0.8% |
| 10003 | 17440 | 0.8% |
| 10027 | 16710 | 0.7% |
| 10005 | 16601 | 0.7% |
| 10018 | 16553 | 0.7% |
| 10059 | 16058 | 0.7% |
| 10034 | 14884 | 0.6% |
| Other values (587) | 2123588 |
| Value | Count | Frequency (%) |
| 10001 | 18327 | |
| 10002 | 17544 | |
| 10003 | 17440 | |
| 10004 | 18254 | |
| 10005 | 16601 | |
| 10006 | 14345 | |
| 10007 | 17522 | |
| 10008 | 7776 | |
| 10009 | 13234 | |
| 10010 | 8694 |
| Value | Count | Frequency (%) |
| 10637 | 2 | < 0.1% |
| 10636 | 3 | < 0.1% |
| 10635 | 41 | < 0.1% |
| 10634 | 15 | < 0.1% |
| 10633 | 2 | < 0.1% |
| 10632 | 2 | < 0.1% |
| 10631 | 16 | < 0.1% |
| 10630 | 43 | < 0.1% |
| 10629 | 6 | < 0.1% |
| 10626 | 148 |
product_id
Real number (ℝ)
High correlation 
| Distinct | 780 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20389.669 |
| Minimum | 20001 |
|---|---|
| Maximum | 21276 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | 20001 |
|---|---|
| 5-th percentile | 20020 |
| Q1 | 20133 |
| median | 20321 |
| Q3 | 20605 |
| 95-th percentile | 20967 |
| Maximum | 21276 |
| Range | 1275 |
| Interquartile range (IQR) | 472 |
Descriptive statistics
| Standard deviation | 301.12145 |
|---|---|
| Coefficient of variation (CV) | 0.014768334 |
| Kurtosis | -0.38853705 |
| Mean | 20389.669 |
| Median Absolute Deviation (MAD) | 218 |
| Skewness | 0.71355894 |
| Sum | 4.6763319 × 1010 |
| Variance | 90674.126 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20111 | 7973 | 0.3% |
| 20122 | 7950 | 0.3% |
| 20120 | 7537 | 0.3% |
| 20326 | 7397 | 0.3% |
| 20132 | 7199 | 0.3% |
| 20004 | 7139 | 0.3% |
| 20276 | 7097 | 0.3% |
| 20058 | 7006 | 0.3% |
| 20027 | 6964 | 0.3% |
| 20013 | 6964 | 0.3% |
| Other values (770) | 2220255 |
| Value | Count | Frequency (%) |
| 20001 | 6172 | |
| 20002 | 6000 | |
| 20003 | 6793 | |
| 20004 | 7139 | |
| 20005 | 5911 | |
| 20006 | 6497 | |
| 20007 | 6906 | |
| 20008 | 6453 | |
| 20009 | 5596 | |
| 20010 | 4611 |
| Value | Count | Frequency (%) |
| 21276 | 64 | |
| 21267 | 67 | |
| 21266 | 94 | |
| 21265 | 93 | |
| 21263 | 130 | |
| 21262 | 122 | |
| 21259 | 128 | |
| 21256 | 115 | |
| 21252 | 67 | |
| 21248 | 120 |
plan_precios_cuidados
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 109.4 MiB |
| 0 | |
|---|---|
| 1 | 27930 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2265551 | |
| 1 | 27930 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2265551 | |
| 1 | 27930 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2265551 | |
| 1 | 27930 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2293481 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2265551 | |
| 1 | 27930 | 1.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2293481 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2265551 | |
| 1 | 27930 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2293481 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2265551 | |
| 1 | 27930 | 1.2% |
cust_request_qty
Real number (ℝ)
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1707967 |
| Minimum | 1 |
|---|---|
| Maximum | 92 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 7 |
| Maximum | 92 |
| Range | 91 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 3.6562439 |
|---|---|
| Coefficient of variation (CV) | 1.6842866 |
| Kurtosis | 53.295061 |
| Mean | 2.1707967 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.3016896 |
| Sum | 4978681 |
| Variance | 13.368119 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1581270 | |
| 2 | 349298 | 15.2% |
| 3 | 117208 | 5.1% |
| 4 | 63774 | 2.8% |
| 5 | 36187 | 1.6% |
| 6 | 24521 | 1.1% |
| 7 | 18116 | 0.8% |
| 8 | 14343 | 0.6% |
| 9 | 10938 | 0.5% |
| 10 | 9153 | 0.4% |
| Other values (74) | 68673 | 3.0% |
| Value | Count | Frequency (%) |
| 1 | 1581270 | |
| 2 | 349298 | 15.2% |
| 3 | 117208 | 5.1% |
| 4 | 63774 | 2.8% |
| 5 | 36187 | 1.6% |
| 6 | 24521 | 1.1% |
| 7 | 18116 | 0.8% |
| 8 | 14343 | 0.6% |
| 9 | 10938 | 0.5% |
| 10 | 9153 | 0.4% |
| Value | Count | Frequency (%) |
| 92 | 1 | |
| 90 | 1 | |
| 88 | 1 | |
| 85 | 2 | |
| 84 | 1 | |
| 83 | 1 | |
| 79 | 1 | |
| 78 | 1 | |
| 77 | 1 | |
| 76 | 1 |
cust_request_tn
Real number (ℝ)
High correlation  Skewed 
| Distinct | 92001 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.50052053 |
| Minimum | 0.0001 |
|---|---|
| Maximum | 551.56137 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | 0.0001 |
|---|---|
| 5-th percentile | 0.00197 |
| Q1 | 0.01027 |
| median | 0.04046 |
| Q3 | 0.16528 |
| 95-th percentile | 1.64433 |
| Maximum | 551.56137 |
| Range | 551.56127 |
| Interquartile range (IQR) | 0.15501 |
Descriptive statistics
| Standard deviation | 3.5233851 |
|---|---|
| Coefficient of variation (CV) | 7.0394417 |
| Kurtosis | 2676.5649 |
| Mean | 0.50052053 |
| Median Absolute Deviation (MAD) | 0.03614 |
| Skewness | 37.72863 |
| Sum | 1147934.3 |
| Variance | 12.414242 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01638 | 13925 | 0.6% |
| 0.00218 | 12909 | 0.6% |
| 0.00983 | 11889 | 0.5% |
| 0.01092 | 10709 | 0.5% |
| 0.00546 | 10651 | 0.5% |
| 0.04095 | 10575 | 0.5% |
| 0.00109 | 10240 | 0.4% |
| 0.03276 | 10179 | 0.4% |
| 0.00491 | 10163 | 0.4% |
| 0.00819 | 9804 | 0.4% |
| Other values (91991) | 2182437 |
| Value | Count | Frequency (%) |
| 0.0001 | 170 | < 0.1% |
| 0.00013 | 79 | < 0.1% |
| 0.00018 | 159 | < 0.1% |
| 0.0002 | 238 | < 0.1% |
| 0.00021 | 628 | |
| 0.00023 | 744 | |
| 0.00025 | 299 | |
| 0.00026 | 217 | < 0.1% |
| 0.00029 | 137 | < 0.1% |
| 0.0003 | 211 | < 0.1% |
| Value | Count | Frequency (%) |
| 551.56137 | 1 | |
| 510.65893 | 1 | |
| 444.41192 | 1 | |
| 439.90647 | 1 | |
| 437.37767 | 1 | |
| 416.64823 | 1 | |
| 407.02225 | 1 | |
| 393.26092 | 1 | |
| 389.02653 | 1 | |
| 384.82574 | 1 |
tn
Real number (ℝ)
High correlation  Skewed 
| Distinct | 91942 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.48947657 |
| Minimum | 0.0001 |
|---|---|
| Maximum | 547.87849 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | 0.0001 |
|---|---|
| 5-th percentile | 0.00197 |
| Q1 | 0.01027 |
| median | 0.04043 |
| Q3 | 0.16474 |
| 95-th percentile | 1.638 |
| Maximum | 547.87849 |
| Range | 547.87839 |
| Interquartile range (IQR) | 0.15447 |
Descriptive statistics
| Standard deviation | 3.3959859 |
|---|---|
| Coefficient of variation (CV) | 6.9379948 |
| Kurtosis | 2740.3007 |
| Mean | 0.48947657 |
| Median Absolute Deviation (MAD) | 0.03611 |
| Skewness | 37.943185 |
| Sum | 1122605.2 |
| Variance | 11.53272 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01638 | 13930 | 0.6% |
| 0.00218 | 12908 | 0.6% |
| 0.00983 | 11889 | 0.5% |
| 0.01092 | 10714 | 0.5% |
| 0.00546 | 10653 | 0.5% |
| 0.04095 | 10575 | 0.5% |
| 0.00109 | 10242 | 0.4% |
| 0.03276 | 10193 | 0.4% |
| 0.00491 | 10162 | 0.4% |
| 0.00819 | 9811 | 0.4% |
| Other values (91932) | 2182404 |
| Value | Count | Frequency (%) |
| 0.0001 | 170 | < 0.1% |
| 0.00013 | 79 | < 0.1% |
| 0.00018 | 159 | < 0.1% |
| 0.0002 | 238 | < 0.1% |
| 0.00021 | 628 | |
| 0.00023 | 746 | |
| 0.00025 | 299 | |
| 0.00026 | 217 | < 0.1% |
| 0.00029 | 137 | < 0.1% |
| 0.0003 | 211 | < 0.1% |
| Value | Count | Frequency (%) |
| 547.87849 | 1 | |
| 469.45761 | 1 | |
| 439.90647 | 1 | |
| 437.37767 | 1 | |
| 430.90803 | 1 | |
| 414.05146 | 1 | |
| 389.02653 | 1 | |
| 386.60688 | 1 | |
| 384.82574 | 1 | |
| 379.4427 | 1 |
cat1
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 112.8 MiB |
| PC | |
|---|---|
| HC | |
| FOODS | |
| REF | 3873 |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.5801932 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HC |
|---|---|
| 2nd row | HC |
| 3rd row | HC |
| 4th row | HC |
| 5th row | HC |
Common Values
| Value | Count | Frequency (%) |
| PC | 1275882 | |
| HC | 571463 | |
| FOODS | 442263 | 19.3% |
| REF | 3873 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| pc | 1275882 | |
| hc | 571463 | |
| foods | 442263 | 19.3% |
| ref | 3873 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1847345 | |
| P | 1275882 | |
| O | 884526 | |
| H | 571463 | 9.7% |
| F | 446136 | 7.5% |
| D | 442263 | 7.5% |
| S | 442263 | 7.5% |
| R | 3873 | 0.1% |
| E | 3873 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 5917624 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1847345 | |
| P | 1275882 | |
| O | 884526 | |
| H | 571463 | 9.7% |
| F | 446136 | 7.5% |
| D | 442263 | 7.5% |
| S | 442263 | 7.5% |
| R | 3873 | 0.1% |
| E | 3873 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5917624 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 1847345 | |
| P | 1275882 | |
| O | 884526 | |
| H | 571463 | 9.7% |
| F | 446136 | 7.5% |
| D | 442263 | 7.5% |
| S | 442263 | 7.5% |
| R | 3873 | 0.1% |
| E | 3873 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5917624 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1847345 | |
| P | 1275882 | |
| O | 884526 | |
| H | 571463 | 9.7% |
| F | 446136 | 7.5% |
| D | 442263 | 7.5% |
| S | 442263 | 7.5% |
| R | 3873 | 0.1% |
| E | 3873 | 0.1% |
cat2
Categorical
High correlation 
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 123.8 MiB |
| CABELLO | |
|---|---|
| DEOS | |
| SOPAS Y CALDOS | |
| HOGAR | |
| ROPA LAVADO | |
| Other values (10) |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 7.58106 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VAJILLA |
|---|---|
| 2nd row | VAJILLA |
| 3rd row | VAJILLA |
| 4th row | VAJILLA |
| 5th row | VAJILLA |
Common Values
| Value | Count | Frequency (%) |
| CABELLO | 610741 | |
| DEOS | 430306 | |
| SOPAS Y CALDOS | 262523 | |
| HOGAR | 198987 | 8.7% |
| ROPA LAVADO | 172212 | 7.5% |
| ADEREZOS | 162986 | 7.1% |
| PIEL2 | 129877 | 5.7% |
| VAJILLA | 121088 | 5.3% |
| PIEL1 | 72077 | 3.1% |
| ROPA ACONDICIONADOR | 61854 | 2.7% |
| Other values (5) | 70830 | 3.1% |
Length
| Value | Count | Frequency (%) |
| cabello | 610741 | |
| deos | 430306 | |
| sopas | 262523 | |
| y | 262523 | |
| caldos | 262523 | |
| ropa | 244239 | 8.0% |
| hogar | 198987 | 6.5% |
| lavado | 172212 | 5.6% |
| aderezos | 162986 | 5.3% |
| piel2 | 129877 | 4.2% |
| Other values (8) | 325849 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 2577885 | |
| A | 2512683 | |
| L | 2140377 | |
| E | 1612876 | |
| S | 1414937 | |
| D | 1184616 | |
| C | 1007145 | 5.8% |
| 769285 | 4.4% | |
| P | 715865 | 4.1% |
| R | 691969 | 4.0% |
| Other values (14) | 2759379 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16415778 | |
| Space Separator | 769285 | 4.4% |
| Decimal Number | 201954 | 1.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2577885 | |
| A | 2512683 | |
| L | 2140377 | |
| E | 1612876 | |
| S | 1414937 | |
| D | 1184616 | |
| C | 1007145 | 6.1% |
| P | 715865 | 4.4% |
| R | 691969 | 4.2% |
| B | 610741 | 3.7% |
| Other values (11) | 1946684 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 129877 | |
| 1 | 72077 |
Space Separator
| Value | Count | Frequency (%) |
| 769285 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16415778 | |
| Common | 971239 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 2577885 | |
| A | 2512683 | |
| L | 2140377 | |
| E | 1612876 | |
| S | 1414937 | |
| D | 1184616 | |
| C | 1007145 | 6.1% |
| P | 715865 | 4.4% |
| R | 691969 | 4.2% |
| B | 610741 | 3.7% |
| Other values (11) | 1946684 |
Common
| Value | Count | Frequency (%) |
| 769285 | ||
| 2 | 129877 | 13.4% |
| 1 | 72077 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17387017 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 2577885 | |
| A | 2512683 | |
| L | 2140377 | |
| E | 1612876 | |
| S | 1414937 | |
| D | 1184616 | |
| C | 1007145 | 5.8% |
| 769285 | 4.4% | |
| P | 715865 | 4.1% |
| R | 691969 | 4.0% |
| Other values (14) | 2759379 |
cat3
Text
| Distinct | 84 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 125.2 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 16 |
| Mean length | 7.7897162 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cristalino |
|---|---|
| 2nd row | Cristalino |
| 3rd row | Cristalino |
| 4th row | Cristalino |
| 5th row | Cristalino |
| Value | Count | Frequency (%) |
| shampoo | 291039 | 10.8% |
| aero | 275284 | 10.2% |
| acondicionador | 241799 | 9.0% |
| sopas | 102850 | 3.8% |
| polvo | 93755 | 3.5% |
| mayonesa | 89442 | 3.3% |
| liquido | 88812 | 3.3% |
| jabon | 84291 | 3.1% |
| noaero | 75253 | 2.8% |
| gel | 71860 | 2.7% |
| Other values (80) | 1278456 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1640645 | 9.2% |
| O | 1454152 | 8.1% |
| A | 1359671 | 7.6% |
| a | 1154391 | 6.5% |
| e | 901613 | 5.0% |
| C | 792994 | 4.4% |
| r | 777527 | 4.4% |
| N | 631920 | 3.5% |
| S | 603467 | 3.4% |
| I | 588780 | 3.3% |
| Other values (41) | 7960406 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8794304 | |
| Uppercase Letter | 8671902 | |
| Space Separator | 399360 | 2.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1640645 | |
| a | 1154391 | |
| e | 901613 | |
| r | 777527 | |
| l | 586840 | 6.7% |
| s | 556288 | 6.3% |
| i | 508443 | 5.8% |
| n | 450559 | 5.1% |
| u | 337542 | 3.8% |
| d | 310292 | 3.5% |
| Other values (15) | 1570164 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1454152 | |
| A | 1359671 | |
| C | 792994 | |
| N | 631920 | |
| S | 603467 | |
| I | 588780 | |
| D | 550293 | 6.3% |
| P | 511109 | 5.9% |
| M | 505849 | 5.8% |
| R | 409804 | 4.7% |
| Other values (15) | 1263863 |
Space Separator
| Value | Count | Frequency (%) |
| 399360 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17466206 | |
| Common | 399360 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1640645 | 9.4% |
| O | 1454152 | 8.3% |
| A | 1359671 | 7.8% |
| a | 1154391 | 6.6% |
| e | 901613 | 5.2% |
| C | 792994 | 4.5% |
| r | 777527 | 4.5% |
| N | 631920 | 3.6% |
| S | 603467 | 3.5% |
| I | 588780 | 3.4% |
| Other values (40) | 7561046 |
Common
| Value | Count | Frequency (%) |
| 399360 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17819935 | |
| None | 45631 | 0.3% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1640645 | 9.2% |
| O | 1454152 | 8.2% |
| A | 1359671 | 7.6% |
| a | 1154391 | 6.5% |
| e | 901613 | 5.1% |
| C | 792994 | 4.5% |
| r | 777527 | 4.4% |
| N | 631920 | 3.5% |
| S | 603467 | 3.4% |
| I | 588780 | 3.3% |
| Other values (40) | 7914775 |
None
| Value | Count | Frequency (%) |
| ñ | 45631 |
brand
Categorical
High correlation 
| Distinct | 35 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 121.0 MiB |
| NIVEA | |
|---|---|
| DEOS1 | |
| SHAMPOO3 | |
| MAGGI | |
| MUSCULO | |
| Other values (30) |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 6.3189981 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Importado |
|---|---|
| 2nd row | Importado |
| 3rd row | Importado |
| 4th row | Importado |
| 5th row | Importado |
Common Values
| Value | Count | Frequency (%) |
| NIVEA | 281190 | |
| DEOS1 | 275302 | |
| SHAMPOO3 | 268663 | |
| MAGGI | 247823 | |
| MUSCULO | 200614 | 8.7% |
| LIMPIEX | 167424 | 7.3% |
| NATURA | 97598 | 4.3% |
| SHAMPOO2 | 90845 | 4.0% |
| SHAMPOO1 | 81904 | 3.6% |
| COLBERT | 66598 | 2.9% |
| Other values (25) | 515520 |
Length
| Value | Count | Frequency (%) |
| nivea | 281190 | |
| deos1 | 275302 | |
| shampoo3 | 268663 | |
| maggi | 247823 | |
| musculo | 200614 | 8.7% |
| limpiex | 167424 | 7.3% |
| natura | 97598 | 4.3% |
| shampoo2 | 90845 | 4.0% |
| shampoo1 | 81904 | 3.6% |
| colbert | 66598 | 2.9% |
| Other values (25) | 515520 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 1750437 | |
| A | 1596947 | 11.0% |
| M | 1223579 | 8.4% |
| S | 1096586 | 7.6% |
| I | 1089783 | 7.5% |
| E | 1079484 | 7.4% |
| P | 708908 | 4.9% |
| L | 598972 | 4.1% |
| G | 570968 | 3.9% |
| N | 569724 | 3.9% |
| Other values (25) | 4207114 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 13490582 | |
| Decimal Number | 886416 | 6.1% |
| Lowercase Letter | 115504 | 0.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 1750437 | |
| A | 1596947 | |
| M | 1223579 | 9.1% |
| S | 1096586 | 8.1% |
| I | 1089783 | 8.1% |
| E | 1079484 | 8.0% |
| P | 708908 | 5.3% |
| L | 598972 | 4.4% |
| G | 570968 | 4.2% |
| N | 569724 | 4.2% |
| Other values (15) | 3205194 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 28876 | |
| m | 14438 | |
| p | 14438 | |
| r | 14438 | |
| t | 14438 | |
| a | 14438 | |
| d | 14438 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 432748 | |
| 3 | 324153 | |
| 2 | 129515 | 14.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13606086 | |
| Common | 886416 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 1750437 | |
| A | 1596947 | |
| M | 1223579 | 9.0% |
| S | 1096586 | 8.1% |
| I | 1089783 | 8.0% |
| E | 1079484 | 7.9% |
| P | 708908 | 5.2% |
| L | 598972 | 4.4% |
| G | 570968 | 4.2% |
| N | 569724 | 4.2% |
| Other values (22) | 3320698 |
Common
| Value | Count | Frequency (%) |
| 1 | 432748 | |
| 3 | 324153 | |
| 2 | 129515 | 14.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14492502 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 1750437 | |
| A | 1596947 | 11.0% |
| M | 1223579 | 8.4% |
| S | 1096586 | 7.6% |
| I | 1089783 | 7.5% |
| E | 1079484 | 7.4% |
| P | 708908 | 4.9% |
| L | 598972 | 4.1% |
| G | 570968 | 3.9% |
| N | 569724 | 3.9% |
| Other values (25) | 4207114 |
sku_size
Real number (ℝ)
High correlation 
| Distinct | 67 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 415.88843 |
| Minimum | 1 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 90 |
| median | 220 |
| Q3 | 450 |
| 95-th percentile | 1000 |
| Maximum | 10000 |
| Range | 9999 |
| Interquartile range (IQR) | 360 |
Descriptive statistics
| Standard deviation | 677.77956 |
|---|---|
| Coefficient of variation (CV) | 1.6297149 |
| Kurtosis | 46.201652 |
| Mean | 415.88843 |
| Median Absolute Deviation (MAD) | 170 |
| Skewness | 5.338533 |
| Sum | 9.5383221 × 108 |
| Variance | 459385.14 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 223429 | 9.7% |
| 90 | 156578 | 6.8% |
| 400 | 148808 | 6.5% |
| 50 | 136509 | 6.0% |
| 350 | 123263 | 5.4% |
| 10 | 107218 | 4.7% |
| 750 | 103162 | 4.5% |
| 100 | 95921 | 4.2% |
| 300 | 78059 | 3.4% |
| 3000 | 70275 | 3.1% |
| Other values (57) | 1050259 |
| Value | Count | Frequency (%) |
| 1 | 13408 | 0.6% |
| 2 | 16616 | 0.7% |
| 3 | 3010 | 0.1% |
| 4 | 19199 | 0.8% |
| 5 | 43520 | |
| 6 | 14249 | 0.6% |
| 8 | 13526 | 0.6% |
| 10 | 107218 | |
| 12 | 21712 | 0.9% |
| 15 | 26654 | 1.2% |
| Value | Count | Frequency (%) |
| 10000 | 2017 | 0.1% |
| 5000 | 6858 | 0.3% |
| 4000 | 3500 | 0.2% |
| 3000 | 70275 | |
| 2000 | 3319 | 0.1% |
| 1400 | 3906 | 0.2% |
| 1250 | 3478 | 0.2% |
| 1000 | 24562 | 1.1% |
| 950 | 12883 | 0.6% |
| 930 | 60950 |
stock_final
Real number (ℝ)
Zeros 
| Distinct | 10047 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.685998 |
| Minimum | -13.66656 |
|---|---|
| Maximum | 1562.0245 |
| Zeros | 1353142 |
| Zeros (%) | 59.0% |
| Negative | 23411 |
| Negative (%) | 1.0% |
| Memory size | 17.5 MiB |
Quantile statistics
| Minimum | -13.66656 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 4.82313 |
| 95-th percentile | 52.07557 |
| Maximum | 1562.0245 |
| Range | 1575.691 |
| Interquartile range (IQR) | 4.82313 |
Descriptive statistics
| Standard deviation | 52.754478 |
|---|---|
| Coefficient of variation (CV) | 4.5143323 |
| Kurtosis | 247.31626 |
| Mean | 11.685998 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.319384 |
| Sum | 26801614 |
| Variance | 2783.0349 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1353142 | |
| 0.049 | 727 | < 0.1% |
| 3.42342 | 468 | < 0.1% |
| 0.01327 | 394 | < 0.1% |
| 7.17084 | 391 | < 0.1% |
| 0.11394 | 367 | < 0.1% |
| 0.7204 | 355 | < 0.1% |
| 0.4368 | 341 | < 0.1% |
| 10.49925 | 330 | < 0.1% |
| 0.04423 | 327 | < 0.1% |
| Other values (10037) | 936639 |
| Value | Count | Frequency (%) |
| -13.66656 | 65 | < 0.1% |
| -13.33127 | 196 | |
| -8.19961 | 64 | < 0.1% |
| -8.15986 | 86 | |
| -7.7212 | 24 | < 0.1% |
| -5.86579 | 65 | < 0.1% |
| -5.28091 | 94 | |
| -5.0992 | 51 | < 0.1% |
| -4.87775 | 74 | < 0.1% |
| -4.44673 | 130 |
| Value | Count | Frequency (%) |
| 1562.02448 | 221 | |
| 1284.38214 | 158 | |
| 1212.36734 | 158 | |
| 1146.09799 | 213 | |
| 1097.55623 | 149 | |
| 1057.38804 | 189 | |
| 1037.85386 | 186 | |
| 1031.01561 | 176 | |
| 978.16446 | 46 | < 0.1% |
| 916.3419 | 215 |
Interactions
Correlations
| brand | cat1 | cat2 | cust_request_qty | cust_request_tn | customer_id | plan_precios_cuidados | product_id | sku_size | stock_final | tn | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| brand | 1.000 | 1.000 | 0.832 | 0.012 | 0.023 | 0.039 | 0.237 | 0.369 | 0.253 | 0.098 | 0.023 |
| cat1 | 1.000 | 1.000 | 1.000 | 0.016 | 0.019 | 0.054 | 0.040 | 0.329 | 0.206 | 0.078 | 0.018 |
| cat2 | 0.832 | 1.000 | 1.000 | 0.012 | 0.017 | 0.038 | 0.121 | 0.285 | 0.419 | 0.076 | 0.017 |
| cust_request_qty | 0.012 | 0.016 | 0.012 | 1.000 | 0.378 | -0.451 | 0.003 | -0.006 | 0.011 | -0.010 | 0.379 |
| cust_request_tn | 0.023 | 0.019 | 0.017 | 0.378 | 1.000 | -0.514 | 0.000 | -0.605 | 0.479 | 0.033 | 1.000 |
| customer_id | 0.039 | 0.054 | 0.038 | -0.451 | -0.514 | 1.000 | 0.006 | -0.009 | -0.027 | -0.018 | -0.514 |
| plan_precios_cuidados | 0.237 | 0.040 | 0.121 | 0.003 | 0.000 | 0.006 | 1.000 | 0.076 | 0.016 | 0.008 | 0.000 |
| product_id | 0.369 | 0.329 | 0.285 | -0.006 | -0.605 | -0.009 | 0.076 | 1.000 | -0.598 | -0.036 | -0.605 |
| sku_size | 0.253 | 0.206 | 0.419 | 0.011 | 0.479 | -0.027 | 0.016 | -0.598 | 1.000 | 0.083 | 0.479 |
| stock_final | 0.098 | 0.078 | 0.076 | -0.010 | 0.033 | -0.018 | 0.008 | -0.036 | 0.083 | 1.000 | 0.033 |
| tn | 0.023 | 0.018 | 0.017 | 0.379 | 1.000 | -0.514 | 0.000 | -0.605 | 0.479 | 0.033 | 1.000 |
Missing values
Sample
| periodo | customer_id | product_id | plan_precios_cuidados | cust_request_qty | cust_request_tn | tn | cat1 | cat2 | cat3 | brand | sku_size | stock_final | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2017-01-01 | 10234 | 20524 | 0 | 2 | 0.05300 | 0.05300 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 1 | 2017-01-01 | 10032 | 20524 | 0 | 1 | 0.13628 | 0.13628 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 2 | 2017-01-01 | 10217 | 20524 | 0 | 1 | 0.03028 | 0.03028 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 3 | 2017-01-01 | 10125 | 20524 | 0 | 1 | 0.02271 | 0.02271 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 4 | 2017-01-01 | 10012 | 20524 | 0 | 11 | 1.54452 | 1.54452 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 5 | 2017-01-01 | 10080 | 20524 | 0 | 1 | 0.01514 | 0.01514 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 6 | 2017-01-01 | 10015 | 20524 | 0 | 4 | 0.10600 | 0.10600 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 7 | 2017-01-01 | 10062 | 20524 | 0 | 1 | 0.18928 | 0.18928 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 8 | 2017-01-01 | 10159 | 20524 | 0 | 3 | 0.02271 | 0.02271 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| 9 | 2017-01-01 | 10183 | 20524 | 0 | 1 | 0.01514 | 0.01514 | HC | VAJILLA | Cristalino | Importado | 500.0 | 0.0 |
| periodo | customer_id | product_id | plan_precios_cuidados | cust_request_qty | cust_request_tn | tn | cat1 | cat2 | cat3 | brand | sku_size | stock_final | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2293471 | 2019-12-01 | 10021 | 20853 | 0 | 8 | 0.15829 | 0.15829 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293472 | 2019-12-01 | 10093 | 20853 | 0 | 1 | 0.05574 | 0.05574 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293473 | 2019-12-01 | 10003 | 20853 | 0 | 9 | 0.62426 | 0.62426 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293474 | 2019-12-01 | 10367 | 20853 | 0 | 1 | 0.00446 | 0.00446 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293475 | 2019-12-01 | 10278 | 20853 | 0 | 5 | 0.06020 | 0.06020 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293476 | 2019-12-01 | 10105 | 20853 | 0 | 1 | 0.02230 | 0.02230 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293477 | 2019-12-01 | 10092 | 20853 | 0 | 1 | 0.00669 | 0.00669 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293478 | 2019-12-01 | 10006 | 20853 | 0 | 7 | 0.02898 | 0.02898 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293479 | 2019-12-01 | 10018 | 20853 | 0 | 4 | 0.01561 | 0.01561 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |
| 2293480 | 2019-12-01 | 10020 | 20853 | 0 | 2 | 0.01561 | 0.01561 | PC | CABELLO | Shampoo Bebe | NIVEA | 200.0 | 1.82373 |